Search CORE

70 research outputs found

次世代動画像符号化アルゴリズムに関する研究

Author: Jiang Xiantao
Publication venue
Publication date: 30/10/2017
Field of study

Tokushima University Institutional Repository

Maximum-Entropy-Model-Enabled Complexity Reduction Algorithm in Modern Video Coding Standards

Author: Jiang Xiantao
Katayama Takafumi
Song Tian
Publication venue: 'MDPI AG'
Publication date: 21/05/2021
Field of study

Symmetry considerations play a key role in modern science, and any differentiable symmetry of the action of a physical system has a corresponding conservation law. Symmetry may be regarded as reduction of Entropy. This work focuses on reducing the computational complexity of modern video coding standards by using the maximum entropy principle. The high computational complexity of the coding unit (CU) size decision in modern video coding standards is a critical challenge for real-time applications. This problem is solved in a novel approach considering CU termination, skip, and normal decisions as three-class making problems. The maximum entropy model (MEM) is formulated to the CU size decision problem, which can optimize the conditional entropy; the improved iterative scaling (IIS) algorithm is used to solve this optimization problem. The classification features consist of the spatio-temporal information of the CU, including the rate–distortion (RD) cost, coded block flag (CBF), and depth. For the case analysis, the proposed method is based on High Efficiency Video Coding (H.265/HEVC) standards. The experimental results demonstrate that the proposed method can reduce the computational complexity of the H.265/HEVC encoder significantly. Compared with the H.265/HEVC reference model, the proposed method can reduce the average encoding time by 53.27% and 56.36% under low delay and random access configurations, while Bjontegaard Delta Bit Rates (BD-BRs) are 0.72% and 0.93% on average

Tokushima University Institutional Repository

Low-Complexity and Hardware-Friendly H.265/HEVC Encoder for Vehicular Ad-Hoc Networks

Author: Feng Jie
Jiang Xiantao
Katayama Takafumi
Song Tian
Publication venue: 'MDPI AG'
Publication date: 26/08/2021
Field of study

Real-time video streaming over vehicular ad-hoc networks (VANETs) has been considered as a critical challenge for road safety applications. The purpose of this paper is to reduce the computation complexity of high efficiency video coding (HEVC) encoder for VANETs. Based on a novel spatiotemporal neighborhood set, firstly the coding tree unit depth decision algorithm is presented by controlling the depth search range. Secondly, a Bayesian classifier is used for the prediction unit decision for inter-prediction, and prior probability value is calculated by Gibbs Random Field model. Simulation results show that the overall algorithm can significantly reduce encoding time with a reasonably low loss in encoding efficiency. Compared to HEVC reference software HM16.0, the encoding time is reduced by up to 63.96%, while the Bjontegaard delta bit-rate is increased by only 0.76–0.80% on average. Moreover, the proposed HEVC encoder is low-complexity and hardware-friendly for video codecs that reside on mobile vehicles for VANETs

Tokushima University Institutional Repository

Spatial Correlation-Based Motion-Vector Prediction for Video-Coding Efficiency Improvement

Author: Jiang Xiantao
Katayama Takafumi
Leu Jenq-Shiou
Song Tian
Publication venue: 'MDPI AG'
Publication date: 21/05/2021
Field of study

H.265/HEVC achieves an average bitrate reduction of 50% for fixed video quality compared with the H.264/AVC standard, while computation complexity is significantly increased. The purpose of this work is to improve coding efficiency for the next-generation video-coding standards. Therefore, by developing a novel spatial neighborhood subset, efficient spatial correlation-based motion vector prediction (MVP) with the coding-unit (CU) depth-prediction algorithm is proposed to improve coding efficiency. Firstly, by exploiting the reliability of neighboring candidate motion vectors (MVs), the spatial-candidate MVs are used to determine the optimized MVP for motion-data coding. Secondly, the spatial correlation-based coding-unit depth-prediction is presented to achieve a better trade-off between coding efficiency and computation complexity for interprediction. This approach can satisfy an extreme requirement of high coding efficiency with not-high requirements for real-time processing. The simulation results demonstrate that overall bitrates can be reduced, on average, by 5.35%, up to 9.89% compared with H.265/HEVC reference software in terms of the Bjontegaard Metric

Tokushima University Institutional Repository

Quality-Oriented Perceptual HEVC Based on the Spatiotemporal Saliency Detection Model

Author: Jiang Xiantao
Katayama Takafumi
Song Tian
Wang Lu
Zhu Daqi
Publication venue: 'MDPI AG'
Publication date: 18/12/2020
Field of study

Perceptual video coding (PVC) can provide a lower bitrate with the same visual quality compared with traditional H.265/high efficiency video coding (HEVC). In this work, a novel H.265/HEVC-compliant PVC framework is proposed based on the video saliency model. Firstly, both an effective and efficient spatiotemporal saliency model is used to generate a video saliency map. Secondly, a perceptual coding scheme is developed based on the saliency map. A saliency-based quantization control algorithm is proposed to reduce the bitrate. Finally, the simulation results demonstrate that the proposed perceptual coding scheme shows its superiority in objective and subjective tests, achieving up to a 9.46% bitrate reduction with negligible subjective and objective quality loss. The advantage of the proposed method is the high quality adapted for a high-definition video application

Tokushima University Institutional Repository

Domain Adaptation through Photorealistic Enhanced Images for Semantic Segmentation

Author: Jiang Xiantao
Katayama Takafumi
Leu Jenq-Shiou
Shimamoto Takashi
Song Tian
Publication venue: 'Hindawi Limited'
Publication date: 07/02/2023
Field of study

In this paper, three types of domain adaptation which are defined as image-level domain adaptation, interdomain adaptation, and intradomain adaptation are efficiently combined to construct a high efficiency framework for semantic segmentation. The proposed domain adaptation platform can achieve a high reduction of time-consuming to generate exhausted supervised data in the real world using photorealistic images. The proposed framework achieved a mean Intersection-over-Union (mIoU) of 45.0%. Furthermore, by combining the proposed method with intradomain adaptation, the improvement of 1.2% mIoU is achieved compared to previous work

Tokushima University Institutional Repository

Saliency-Enabled Coding Unit Partitioning and Quantization Control for Versatile Video Coding

Author: Jiang Xiantao
Jin Jiayuan
Li Wei
Song Tian
Yu Fei Richard
Publication venue: 'MDPI AG'
Publication date: 01/08/2022
Field of study

The latest video coding standard, versatile video coding (VVC), has greatly improved coding efficiency over its predecessor standard high efficiency video coding (HEVC), but at the expense of sharply increased complexity. In the context of perceptual video coding (PVC), the visual saliency model that utilizes the characteristics of the human visual system to improve coding efficiency has become a reliable method due to advances in computer performance and visual algorithms. In this paper, a novel VVC optimization scheme compliant PVC framework is proposed, which consists of fast coding unit (CU) partition algorithm and quantization control algorithm. Firstly, based on the visual saliency model, we proposed a fast CU division scheme, including the redetermination of the CU division depth by calculating Scharr operator and variance, as well as the executive decision for intra sub-partitions (ISP), to reduce the coding complexity. Secondly, a quantization control algorithm is proposed by adjusting the quantization parameter based on multi-level classification of saliency values at the CU level to reduce the bitrate. In comparison with the reference model, experimental results indicate that the proposed method can reduce about 47.19% computational complexity and achieve a bitrate saving of 3.68% on average. Meanwhile, the proposed algorithm has reasonable peak signal-to-noise ratio losses and nearly the same subjective perceptual quality

Directory of Open Access Journals

Tokushima University Institutional Repository

DRMS for Patient-Level Lymph Node Status Classification

Author: Jiang Xiantao
Katayama Takafumi
Leu Jenq-Shiou
Shimamoto Takashi
Song Tian
Wang Lu
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 14/03/2022
Field of study

Generally, automatic diagnosis of the presence of metastases in lymph nodes has therapeutic implications for breast cancer patients. Detection and classification of breast cancer metastases have high clinical relevance, especially in whole-slide images of histological lymph node sections. Fast early detection leads to huge improvement of patient’s survival rate. However, currently pathologists mainly detect the metastases with microscopic assessments. This diagnosis procedure is extremely laborious and prone to inevitable missed diagnoses. Therefore, automated, accurate patient-level classification would hold great promise to reduce the pathologist’s workload while also reduce the subjectivity of diagnosis. In this paper, we provide a novel deep regional metastases segmentation (DRMS) framework for the patient-level lymph node status classification. First, a deep segmentation network (DSNet) is proposed to detect the regional metastases in patch-level. Then, we adopt the density-based spatial clustering of applications with noise (DBSCAN) to predict the whole metastases from individual slides. Finally, we determine patient-level pN-stages by aggregating each individual slide-level prediction. In combination with the above techniques, the framework can make better use of the multi-grained information in histological lymph node section of whole-slice images. Experiments on large-scale clinical datasets (e.g., CAMELYON17) demonstrate that our method delivers advanced performance and provides consistent and accurate metastasis detection in clinical trials

Tokushima University Institutional Repository

Reporting quality and statistical analysis of published dose-response meta-analyses was suboptimal: A cross-sectional literature survey

Author: Cao Shiyi
Chen Fan
Jiang Qingqing
Liu Qiaoyan
Lu Zuxun
Song Fujian
Zeng Xiantao
Publication venue: 'Elsevier BV'
Publication date: 01/11/2019
Field of study

Objective To investigate the characteristics, methodological quality, and reporting of statistical analyses of published dose-response meta-analyses (DRMAs). Study Design and Setting We searched PubMed to identify DRMAs published in 2017. The reporting characteristics and methodological qualities were assessed by the PRISMA (27 items) and AMSTAR (11 items) respectively. We also summarized the reporting of statistical analyses of included DRMAs. Results We identified 93 DRMAs, most of which (59/93) were conducted by Chinese researchers, the main outcome was the incidence of cancers. Of the PRISMA and AMSTAR items, twenty and five were well complied (80% or more) respectively. The compliance rates of several PRISMA checklist items, such as structured summary, objectives, protocol and registration, and funding, were less than 50%. There were no criteria to estimate the doses for the open-ended intervals of exposure or intervention doses. When the restricted cubic splines were used to fit nonlinear dose-response relationships, there were also no criteria to determine the fixed knots. Conclusion The adherence to the methodological items of reporting guidelines and statistical analysis of published DRMAs were suboptimal. Development of reporting guidelines to assist authors in writing and readers in critically appraising the reports of DRMAs is timely

University of East Anglia digital repository